NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Extremum Seeking and Adaptive Dynamic Programming for Distributed Feedback Optimization

https://doi.org/10.1109/CDC56724.2024.10886832

Liu, Tong; Krstić, Miroslav; Jiang, Zhong-Ping (December 2024, IEEE)

This paper studies the distributed feedback optimization problem for linear multi-agent systems without precise knowledge of local costs and agent dynamics. The proposed solution is based on a hierarchical approach that uses upper-level coordinators to adjust reference signals toward the global optimum and lower-level controllers to regulate agents’ outputs toward the reference signals. In the absence of precise information on local gradients and agent dynamics, an extremum-seeking mechanism is used to enforce a gradient descent optimization strategy, and an adaptive dynamic programming approach is taken to synthesize an internal-model-based optimal tracking controller. The whole procedure relies only on measurements of local costs and input-state data along agents’ trajectories. Moreover, under appropriate conditions, the closed-loop signals are bounded and the output of the agents exponentially converges to a small neighborhood of the desired extremum. A numerical example is conducted to validate the efficacy of the proposed method.
more » « less
Full Text Available
Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach

https://doi.org/10.1016/j.automatica.2024.111944

Cui, Leilei; Pang, Bo; Krstić, Miroslav; Jiang, Zhong-Ping (January 2025, Automatica)

This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving.
more » « less
Full Text Available
Deception in Nash Equilibrium Seeking

https://doi.org/10.1109/TAC.2025.3582524

Tang, Michael; Javed, Umar; Chen, Xudong; Krstić, Miroslav; Poveda, Jorge I (December 2025, IEEE Transactions on Automatic Control)

Full Text Available
Fixed-Time Input-to-State Stability for Singularly Perturbed Systems via Composite Lyapunov Functions

https://doi.org/10.1109/TAC.2025.3631559

Tang, Michael; Krstić, Miroslav; Poveda, Jorge I (January 2025, IEEE Transactions on Automatic Control)

Full Text Available
Constrained Control of Input Delayed Systems With Partially Compensated Input Delays

https://doi.org/10.1115/DSCC2020-3271

Abel, Imoleayo; Janković, Mrdjan; Krstić, Miroslav (October 2020, ASME 2020 Dynamic Systems and Control Conference)
null (Ed.)
Abstract Control Barrier Functions (CBFs) have become popular for enforcing — via barrier constraints — the safe operation of nonlinear systems within an admissible set. For systems with input delay(s) of the same length, constrained control has been achieved by combining a CBF for the delay free system with a state predictor that compensates the single input delay. Recently, this approach was extended to multi input systems with input delays of different lengths. One limitation of this extension is that barrier constraint adherence can only be guaranteed after the longest input delay has been compensated and all input channels become available for control. In this paper, we consider the problem of enforcing constraint adherence when only a subset of input delays have been compensated. In particular, we propose a new barrier constraint formulation that ensures that when possible, a subset of input channels with shorter delays will be utilized for keeping the system in the admissible set even before longer input delays have been compensated. We include a numerical example to demonstrate the effectiveness of the proposed approach.
more » « less
Full Text Available

Search for: All records